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Abstract 

Background: Relationships between improvements in lung function and other clinical outcomes in chronic 
obstructive pulmonary disease (COPD) are not documented extensively. We examined whether changes in trough 
forced expiratory volume in 1 second (FEV^ are correlated with changes in patient-reported outcomes. 

Methods: Pooled data from three indacaterol studies (n = 3313) were analysed. Means and responder rates for 
outcomes including change from baseline in Transition Dyspnoea Index (TDI), St. George's Respiratory 
Questionnaire (SGRQ) scores (at 12, 26 and 52 weeks), and COPD exacerbation frequency (rate/year) were tabulated 
across categories of AFEVV Also, generalised linear modelling was performed adjusting for covariates such as 
baseline severity and inhaled corticosteroid use. 

Results: With increasing positive AFEV l7 TDI and ASGRQ improved at all timepoints, exacerbation rate over the 
study duration declined (P < 0.001). Individual-level correlations were 0.03-0.18, but cohort-level correlations were 
0.79-0.95. At 26 weeks, a 100 ml increase in FEVq was associated with improved TDI (0.46 units), ASGRQ (1.3-1.9 
points) and exacerbation rate (12% decrease). Overall, adjustments for baseline covariates had little impact on the 
relationship between AFEV^ and outcomes. 

Conclusions: These results suggest that larger improvements in FEVq are likely to be associated with larger patient- 
reported benefits across a range of clinical outcomes. 

Trial Registration: ClinicalTrials.gov NCT00393458, NCT00463567, and NCT00624286 
Keywords: COPD, spirometry, FEV 1# health status, dyspnoea 



Introduction patient. Decision-makers also require evidence to assess 

In the absence of other widely accepted and validated trends across large cohorts of patients, 

markers for chronic obstructive pulmonary disease Several studies have demonstrated a significant rela- 

(COPD), lung function measurement, specifically forced tionship between poor lung function and worsened 

expiratory volume in 1 second (FEV!), has been used as health and economic outcomes in patients with COPD 

a global marker for pathophysiological changes [1] and [2-13], but few have investigated whether changes in 

by regulators in the drug approval process. Conse- lung function associated with an intervention are corre- 

quently, clinical trials for new products in COPD are lated with changes in such endpoints [12-15]. There is 

typically powered to demonstrate significant improve- good evidence that declining lung function leads to wor- 

ments in FEVi. However, healthcare professionals are sened patient outcomes, but a surprising lack of evi- 

more likely to be interested in improvements in patient- dence that improvements in lung function are correlated 

reported outcomes such as symptoms and health status, with improvements in symptomatic outcomes, 

which may better reflect treatment impact on the Indacaterol is a novel, inhaled, ultra-long-acting p 2 - 

agonist. Initial Phase III trials included over 3000 

* Correspondence: pjones@sgul.ac.uk patients, providing a large pooled dataset. We analysed 

division of Clinical Science, St George's, University of London, London, UK this rfataset in order to examine the relationships 
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between change in FEV! and outcomes including dys- 
pnoea, health status, exacerbations and rescue medica- 
tion use. 

Methods 

Study design and treatments 

This investigation was a pooled analysis of patient-level 
data from three Phase III, randomised studies: Study 1 
(INVOLVE [INdacaterol: Value in COPD: Longer term 
Validation of Efficacy and safety]) was a double-blind 
comparison of indacaterol 300 (ig or 600 (ig once daily 
with formoterol 12 |ig twice daily and placebo for 52 
weeks; Study 2 (INHANCE [INdacaterol versus tiotro- 
pium to Help Achieve New COPD treatment Excel- 
lence]) compared double-blind indacaterol 150 (ig or 
300 (ig once daily with placebo and open-label tiotro- 
pium 18 (ig once daily for 26 weeks; Study 3 (INLIGHT 
1 [INdacaterol: efficacy evaLuation usInG 150 |ig doses 
witH COPD PatienTs]) was a 12-week study comparing 
double-blind indacaterol 150 (ig once daily with placebo 
for 12 weeks. Patients were permitted to continue 
inhaled corticosteroid (ICS) monotherapy if the dose 
and regimen were stable for 1 month before screening, 
and were to remain stable throughout the study; 
patients were also permitted rescue salbutamol as 
needed. Full details have been reported elsewhere 
[16-18]. All studies were conducted in accordance with 
the Declaration of Helsinki (1989) and local applicable 
laws and regulations. Approval was obtained from the 
Institutional Review Board or Independent Ethics Com- 
mittee of each participating study centre. All patients 
provided written informed consent prior to participating 
in each study included in the pooled analysis. All patient 
data was anonymised. 

Patients 

Patients were male or female, aged > 40 years, with a 
smoking history of > 20 pack years and a diagnosis of 
moderate-to-severe COPD [19]. All patients in whom 
trough FEVx measurements were available both at base- 
line and at 12 weeks were included. Patients with 
extreme changes from baseline in trough FEVl (> +500 
or < -500 ml) were excluded, as these values were con- 
sidered erroneous. 

Endpoints 

The primary endpoint in all three studies was trough 
FEVx (average of the 23 h 10 min and 23 h 45 min 
post-dose value) after 12 weeks of treatment. Trough 
FEV! at baseline was defined as the average of the FEVi 
values 50 and 15 min prior to the first dose of study 
drug. Trough YEVi was assessed at the end of Weeks 4, 
8 and 12 in all studies, Weeks 16, 20, 24, 28, 36, 44 and 
52 in Study 1, and Weeks 16, 21 and 26 in Study 2. In 



all studies, spirometry equipment and performance of 
spirometric testing was required to be in accordance 
with ATS/ERS standards [20]. 

Secondary endpoints included health status (using the 
St George's Respiratory Questionnaire [SGRQ] [21]), and 
dyspnoea (using the Transition Dyspnoea Index [TDI] 
[22]; Studies 1 and 2 only). The SGRQ provides scores 
between 0 and 100, with higher values indicating greater 
impairment. The TDI is inherently a change from base- 
line and provides values between -9 and +9, with positive 
values indicating improvement. Rescue medication use 
(number of puffs of salbutamol) was recorded by patients 
in diaries. COPD exacerbations were defined as the onset 
or worsening of > 1 respiratory symptom for > 3 conse- 
cutive days, requiring intensified treatment (e.g. systemic 
steroids, antibiotics, oxygen) and/or hospitalisation or 
emergency room visit. Severe exacerbations were those 
requiring hospitalisation. 

Statistical methods 

The primary objective was to examine relationships 
between patient-reported outcomes and change from 
baseline in trough FEV! (AFEV^ using data summarisa- 
tion and model-based analysis. Outcome variables for 
both analysis approaches were TDI, change from base- 
line in SGRQ (ASGRQ), rescue medication use and 
exacerbation rates. 

For TDI and ASGRQ, relationships were examined 
with the average of each patient's AFEVx through the 
corresponding week of observation. For rescue medica- 
tion use and exacerbations, the average AFEVi over 
time on treatment was used. 
Data summaries and related inferences 
TDI and ASGRQ were handled as outcome variables at 
12, 24/26 and 52 weeks. Responders were patients who 
achieved at least the minimal clinically important differ- 
ence (MCID) from baseline (one and four units for TDI 
and SGRQ, respectively [21,22]). Daily rescue medica- 
tion use was the number of puffs during treatment 
divided by the number of days on treatment. Rate of 
exacerbations was the number of exacerbations on treat- 
ment, normalised to 1 year (365 x number of exacerba- 
tions while on treatment/ days on treatment). 

For each of the timepoints, outcomes and responder 
rates for ASGRQ and TDI were tabulated across five 
categories of AFEVx that were chosen to distribute 
patients approximately equally across categories, and 
bounded above and below by ± 500 ml. The hypothesis 
of equality across categories was tested by the Kruskal- 
Wallis test. Correlation coefficients were computed 
between observed individual values of AFEVx and the 
outcome, and between the category midpoint values of 
AFEVi (-275 ml, 0 ml, 100 ml, 200 ml and 375 ml) and 
the category mean response of the outcome. 
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Model-based analyses 

In line with established statistical procedures, general- 
ised linear modelling [23,24] was performed to examine 
the relationship between AFEVi and each outcome vari- 
able. For TDI and ASGRQ, observations at all time- 
points were modelled together using repeated-measures 
multiple regression analyses, assuming constant variance 
and an unstructured correlation matrix. Time was 
included both as a main effect and in an interaction 
with AFEVi. 

Rescue medication use and exacerbations were mod- 
elled as number of puffs and number of exacerbations, 
respectively, during time on treatment. Rescue medica- 
tion use was modelled using the zero-inflated negative 
binomial distribution for likelihood-based model build- 
ing, and then the final model was refitted using quasi- 
likelihood to report parameter estimates. Exacerbations 
were modelled using the negative binomial distribution. 
For both, in order to ensure positivity of the modelled 
mean response, the logarithm of the mean was repre- 
sented as linear in the covariates, and then the mean 
was found by taking antilogs. 

Other predictor variables were baseline trough FEVi 
(continuous), age (continuous), gender (binary), ICS use 
(binary: yes or no), treatment (indacaterol, formoterol, 
tiotropium or placebo), screening FEVi measured to 
assess reversibility before and after a short-acting p 2 - 
agonist, and before and after a short-acting anticholiner- 
gic, world region (Western Europe and the USA, East- 
ern Europe and Turkey, Rest-of- World), and time at 
risk for exacerbations and rescue medication use. Dis- 
ease severity was included as a binary variable, based on 
the Global initiative for chronic Obstructive Lung Dis- 
ease (GOLD) stages [19]; predominantly GOLD 2 (mod- 
erate or less severity including 91% moderate, referred 
to subsequently as GOLD 2) versus predominantly 
GOLD 3 (severe or greater severity including 98% 
severe, referred to as GOLD 3), as measured by per cent 
predicted FEVi at screening after short-acting p 2 -ago- 
nist. The default condition for all models was: baseline 
FEVi of 1.3 1, age 65 years, gender male, GOLD 2, no 
ICS use, indacaterol treatment, screening FEVi before 
(and after) reversibility testing of 1.3 1 (1.5 1) and Wes- 
tern Europe/USA region. All statistical comparisons 
were made relative to this combination of covariates, 
and unless otherwise stated, these were the values of the 
parameters used for predictions by the models. 

Model-based inference steps were performed to test 
for interactions between AFEVi and the covariates treat- 
ment, disease severity, ICS use and world region. For 
this purpose, disease severity was represented jointly by 
baseline FEVi, the binary severity indicator defined 
above, Baseline Dyspnoea Index (BDI) for TDI and base- 
line SGRQ for ASGRQ. To allow for the possibility of 



differing relationships for negative versus positive values 
of AFEVi, a possible breakpoint at AFEVi = 0 was 
tested in each model. The main effects of covariates 
were tested for significance according to Wald P values 
in the final model, with P < 0.01 judged significant with- 
out any adjustments for multiplicity. 

For each outcome variable, the improvement in 
expected response for an increase in EEVi from 0 to 
100 ml was also computed, based on a model that 
excluded treatment effects, to allow for variation in 
AFEVi between, as well as within, treatments. 

Results 

In total, 3313 patients were included in the analysis. 
Patient demographic and clinical characteristics are pre- 
sented in Table 1. Age, pre- and post-bronchodilator 
FEVi and body mass index were well balanced across 
studies. Study 1 included more males, patients taking 
ICS and patients with slightly lower per cent predicted 
FEVi and reversibility. 

Data summaries and related inferences 

The distribution of average AFEVi responses by time- 
point is shown in Table 2, both as frequencies within 
AFEVi categories and as percentiles of distributions. 
Median values ranged from 75 to 94 ml. Approximately 
5% of observations were excluded due to extreme 
AFEVi (± 500 ml) at any timepoint; 0.7% observations 
(24/3313) were less than -500 ml and 4.1% (137/3313) 
were greater than 500 ml at Week 12. 

All relationships between AFEVi an d outcomes were 
statistically significant, except for severe exacerbations 
(Table 3). Individual-level correlations were weak (0.03- 
0.18), reflecting the large variability in outcomes; how- 
ever, cohort-level correlations were stronger (0.79-0.95). 
When outcome means were plotted versus AFEVi mid- 
points, there were clear trends towards greater improve- 
ment in outcomes with increasing AFEVi, particularly 
for positive AFEVi (Figure 1). Responder rates, in terms 
of TDI and ASGRQ, followed a similar pattern to the 
mean outcomes (Table 4). 

Model-based results 

The plots of curves fitted from the model-based analysis 
for each outcome variable versus AFEVi are presented 
in Figures 2a-d. For TDI and ASGRQ, the significant 
breakpoints at zero are evident in the changes of slope 
in the fitted lines. For rescue medication and exacerba- 
tions the breakpoints were not significant. The fitted 
curves for rescue medication and exacerbations are lin- 
ear on logarithmic scales, so appear nonlinear on the 
scales of these plots. 

AFEVi was significantly correlated with TDI score (P 
< 0.0001). A significant breakpoint in the fitted lines is 
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Table 1 Baseline characteristics of study participants included in the analysis 





jlUUy 1 MM VULVL [ 1 OJ 


<itiiH\/ 9 INHANrF T171 


<;tuHv 3 ini \c\-\t i riai 

jiuuy j iiMLivjni i [ i oj 


Total 


n 


1 377 


1 575 


361 


331 3 


Age, years 


64 (8) 


64 (9) 


63 (10) 


64 (9) 


iviaie/temaie, % 


78/22 


63/37 


52/48 


69/31 


Body mass index, kg/m 2 


II (5) 


2/ (6) 


OQ t~7\ 

28 {/) 


0"7 (£\ 

1/ (b) 


rbv-i, % predicted 


53 (14) 


56 (14) 


55 (14) 


55 (14) 


rn/ irxic 0/ * 


51 (10) 


53 (10) 


53 (10) 


52 (10) 


Pre-bronchodilator FEV-|, 1 


1 ir\ ao\ 
I .3d {UA3) 


1 oo tr\ acw 
1 .33 (U.4yj 


1 .34 (0.5 I J 


\ .34 (U.4/J 


Post-bronchodilator FEV-| , 1* 


I .52 (U.4/J 


I .d(J (U.dUJ 


i n (r\ r -)\ 

1 .5 1 (U.52J 


1 .5 1 (,U.4yj 


Reversibility, %* 


1 5.2 (J 5.4) 


15.5 (15.9) 


lo.U (lo./J 


1 4.0 (, I b.ij 


ICS use yes/no, % 


55/45 


38/62 


32/68 


45/55 


Smoker/ex-smoker, % 


40/60 


44/56 


52/48 


43/57 


dUI score 


6.6 (2.2) 


6.5 (2.3) 


NA 


6.5 (2.2) 


SGRQ total score 


44 (18) 


45 (18) 


49 (19) 


45 (18) 


Treatments 










Placebo, n 


322 


311 


176 


809 


Indacaterol 75 ug, n 


0 


67 


0 


67 


Indacaterol 150 ug, n 


0 


346 


185 


531 


Indacaterol 300 ug, n 


363 


357 


0 


720 


Indacaterol 600 ug, n 


344 


68 


0 


412 


Formoterol 12 ug, n 


348 


75 


0 


423 


Tiotropium 18 ug, n 


0 


351 


0 


351 



Data are mean (standard deviation) unless otherwise stated. ^Measured 30 min after salbutamol 400 ug inhalation. Reversibility was calculated as the difference 
between the pre- and post-bronchodilator values of FEVt (in I) as a percentage of the pre-bronchodilator value; BDI, Baseline Dyspnoea Index; FEV^ forced 
expiratory volume in 1 second; FVC, forced vital capacity; ICS, inhaled corticosteroid; INVOLVE, INdacaterol: Value in COPD: Longer term Validation of Efficacy and 
safety; INHANCE, INdacaterol versus tiotropium to Help Achieve New COPD treatment Excellence; INLIGHT, INdacaterol: efficacy evaLuation usInG 150 ug doses 
with COPD PatienTs; NA, not available; SGRQ, St George's Respiratory Questionnaire. 



seen at zero; the slope was significantly shallower for 
negative AFEVi compared with positive A¥EV 1 (P = 
0.003 for the difference between slopes). The slope of 
the relationship (determining the magnitude of change 
in outcome for a given improvement in AYEVi) was not 
significantly affected by treatment, baseline severity, ICS 
use or world region. Hence, the overall model-predicted 
increase in TDI for a 100 ml increase in AFEVi was the 
same for all combinations of covariates, and estimated 
to be 0.46 at Week 24/26. Although the slope of the 
relationship with AFEVi was the same for all covariates, 
the intercept, that is, the TDI corresponding to zero 
change in FEVi, was not. For a given AFEVi, patients 
with lower baseline FEVi, lower BDI, using ICS or on 



placebo, had significantly lower values of TDI, while 
those from Eastern Europe/Turkey and Rest-of- World 
regions had significantly higher values. When covariates 
representative of patients who were less severe were 
inputted into the model (i.e., GOLD 2, no ICS and base- 
line FEVi of 1.595 1), the model-predicted TDI for a 
zero and +100 ml change in FEVi was 1.98 and 2.44, 
respectively. For more severe patients (i.e., GOLD 3, ICS 
and baseline FEVi of 0.95 1), the model-predicted TDI 
for patients with zero and +100 ml change in ¥EV 1 was 
-0.20 and 0.26, respectively. 

There was a significant correlation between ASGRQ 
and AFEVi (P < 0.0001). As with TDI, the slope of the 
relationship between ASGRQ and AFEVi was 



Table 2 Summary information for averages of AFEv^ 



Number of observations in intervals defined by ml ranges Percentiles* of observations 



Average of 
AFEVt 


n 


< 

-500 


-500, 
-50 


-50, 
50 


50, 
150 


150, 
250 


250, 
500 


> 

500 


Minimum 


5% 


25% 


50% 


75% 


95% 


Maximum 


Week 4-12 


3313 


24 


623 


695 


717 


563 


554 


137 


-1180 


-198 


-20 


94 


220 


466 


1966 


Week 4-24/26 


2389 


16 


478 


476 


550 


388 


377 


104 


-1148 


-203 


-30 


88 


214 


474 


1782 


Week 4-52 


1169 


6 


292 


218 


273 


165 


168 


47 


-755 


-223 


-52 


75 


201 


464 


1607 


On treatment 


3313 


24 


708 


662 


751 


536 


501 


131 


-1180 


-213 


-32 


82 


208 


467 


1966 



*For example, when averaging over time on treatment, the minimum average AFEVt was -1180 ml and the maximum was 1966 ml; between these points, 5% 
observations were less than or equal to -213 ml and half were less than or equal to 82 ml; AFEVt, change from baseline in trough forced expiratory volume in 1 
second. 
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Table 3 Outcome means by average AFEVq category, P values for associations between average AFEVq and outcome, 
and correlations at individual and cohort levels 



Average 


Category 


vviinarawai 


Tni at 

i u\ at 


Tni at 

i u\ at 


Tni at 

i u\ at 








Rescue 


Exacerbation 


Severe 


AFEVt (ml) 


midpoint 


rate* (% 


12 


24/26 


52 


at 12 


at 24/ 


at 52 


medication 


rate (per 


exacerbation 




value of 


patients) 


weeks 


weeks 


weeks 


weeks 


26 


weeks 


mean puffs per 


year) (n = 


rate (per 




AFEVt 




(n = 


(n = 


(n = 


(n = 


weeks 


(n = 


day (over study 


3158) 


year) (n = 




(ml) 




2781) 


2208) 


1099) 


3141) 


(n = 


1115) 


duration) (n = 




3158) 
















2215) 




3158) 






-500, -50 


-275 


11.2 


1.44 


1.57 


1.24 


-3.15 


-4.70 


-2.21 


2.46 


0.63 


0.059 


-50, 50 


0 


9.0 


1.31 


1.39 


1.92 


-3.17 


-3.81 


-3.03 


2.57 


0.58 


0.065 


50, 150 


100 


10.1 


1.79 


1.97 


1.65 


-3.84 


-4.74 


-4.22 


2.10 


0.61 


0.057 


150, 250 


200 


10.2 


2.12 


2.23 


2.23 


-5.84 


-6.34 


-6.70 


1.80 


0.51 


0.048 


250, 500 


375 


6.7 


2.68 


3.03 


3.27 


-7.38 


-7.29 


-9.06 


1.66 


0.38 


0.021 



P value < 0.001 < 0.001 < 0.001 < 0.001 < 0.001 < 0.001 < 0.001 < 0.001 0.1 

(Kruskal- 

Wallis) 



Correlation, 

individual 

level 


0.15 


0.14 


0.18 


-0.12 


-0.07 


-0.16 


-0.11 


-0.06 


-0.03 


Correlation, 
cohort level 


0.90 


0.88 


0.92 


-0.90 


-0.79 


-0.95 


-0.88 


-0.89 


-0.81 



^Non-completers in all studies categorised by AFEVt at Day 2; AFEVt, change from baseline in trough forced expiratory volume in 1 second; TDI, Transition 
Dyspnoea Index; ASGRQ, change from baseline in St George's Respiratory Questionnaire 



significantly shallower for negative AYEVi (P = 0.002 for 
the difference between slopes). The slope of the rela- 
tionship with improvement in FEVi was not significantly 
affected by treatment, ICS use, or world region, but it 
was steeper for patients in GOLD 3, and with baseline 
FEVi 0.95 1 compared with GOLD 2 and baseline FEVi 
1.595 1 (P = 0.004). For an increase of AFEVi of 100 ml, 
the model predicted a change in SGRQ of -1.3 for 
GOLD 2 and -1.9 for GOLD 3 patients at Week 24/26. 
Patients with worse baseline FEV!, with worse baseline 
SGRQ, using ICS or on placebo, had significantly higher 
ASGRQ, whereas patients from Eastern Europe/Turkey 
and Rest-of- World regions had significantly lower 
ASGRQ at Week 24/26. For GOLD 2 patients, who had 
used no ICS and had baseline SGRQ of < 31, the 
model-predicted improvement in SGRQ at Week 26 for 
a zero and +100 ml change in ¥EV 1 was -1.6 and -2.9, 
respectively. Similarly, for GOLD 3 patients who had 
used ICS and had baseline SGRQ of > 58, the model- 
predicted improvement in SGRQ at Week 24/26 was 
-0.9 and -2.8, respectively. 

AFEVi was significantly correlated with rescue medi- 
cation use (P < 0.0001). Treatment, baseline severity, 
ICS use or world region, did not significantly affect the 
slope of the relationship, and the slope did not change 
significantly between negative and positive AFEV!. 
Hence, for all combinations of covariates, an increase of 
100 ml in A¥EV 1 is predicted to yield the same 10% 
reduction in rescue medication use. Patients with lower 
baseline FEVi, male patients, those with higher baseline 
medication usage or more severe disease, using ICS or 
on placebo or tiotropium, had significantly higher rates 



of rescue medication usage. Younger patients (< 65 
years) had almost significantly higher rates (P = 0.012, 
versus the defined significance level of P < 0.01). For 
GOLD 2 patients not receiving ICS, the predicted daily 
number of puffs of rescue medication for a zero and 
+ 100 ml AFEVi was 0.89 and 0.80, respectively, and 
1.83 and 1.64 for those in GOLD 3 and using ICS. 

AFEVi was significantly correlated with exacerbations 
(P = 0.002). Treatment, baseline severity, ICS use or 
world region, did not significantly affect the slope of the 
relationship. Furthermore, the slope did not change sig- 
nificantly between negative and positive AFEV!. Hence, 
for all combinations of covariates, an increase of 100 ml 
in AFEVi is predicted to yield the same 12% decrease in 
exacerbations. Patients with lower baseline FEV\ and 
patients using ICS had significantly higher rates of 
exacerbations. Patients from the Eastern Europe/Turkey 
region had significantly lower rates of exacerbations. 
The model estimate for the annual rate of exacerbations 
for patients with a zero and +100 ml AFEVi were 0.29 
and 0.25, respectively, for GOLD 2 patients not using 
ICS; and 1.28 and 1.12, respectively, for patients in 
GOLD 3 and using ICS. As in the summary analysis, the 
rate of severe exacerbations requiring hospitalisation was 
not significantly correlated with AFEVi (P = 0.3). 

Discussion 

Our analyses show that improvement in FEVi is signifi- 
cantly related to changes in the patient-reported out- 
comes TDI, SGRQ, exacerbation rate and rescue 
medication use over 12-52 weeks of treatment. These 
relationships were significant at both an individual and 
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Figure 1 Outcome means in AFEV-, categories versus category midpoint value. Plots show data with Loess smooth curves superimposed. 
AFEV 1; change from baseline in trough forced expiratory volume in 1 second; TDI, Transition Dyspnoea Index; ASGRQ, change from baseline in 
St George's Respiratory Questionnaire. 



population level, although correlations were much stron- 
ger in the population-based analyses. 

Few studies have examined the relationship between 
change in FEVi and change in outcomes. However, our 
results are consistent with analyses of patients from the 
3-year EUROSCOP (The European Respiratory Society 
Study on Chronic Obstructive Pulmonary disease) study, 
in which an improvement of 100 ml in TEV 1 was asso- 
ciated with a 4% reduction in dyspnoea in males [13], 



and a 16-week clinical study, in which a significant, but 
weak correlation between change in YEVi and change in 
SGRQ score was demonstrated (r = 0.33, P = 0.001) 
[14]. Further, a recent systematic review of 22 studies 
found that 100 ml increase in FEVi was associated with 
a statistically significant reduction in SGRQ of 2.5 [15]. 
However, to our knowledge, the current analysis is the 
largest and most comprehensive to investigate the corre- 
lation between change in FEVi and change in outcomes 
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Table 4 Responder rates* for TDI + and ASGRQ + by average AFEVt category 



Average 


Category 


TDI at 12 


TDI at 24/26 


TDI at 52 


ASGRQ at 12 


ASGRQ at 24/26 


ASGRQ at 52 


AFEVt 


midpoint 


weeks % 


weeks % 


weeks % 


weeks % 


weeks % 


weeks % 


(ml) 


value of AFEVt 


responders (n 


responders (n = 


responders (n 


responders (n = 


responders (n = 


responders (n = 




(ml) 


= 2781) 


2208) 


= 1099) 


3141) 


2215) 


1115) 


-500, -50 


-275 


50 


51 


45 


42 


49 


41 


-50, 50 


0 


48 


49 


53 


46 


45 


45 


50, 150 


100 


54 


57 


50 


48 


48 


49 


150, 250 


200 


59 


60 


58 


53 


59 


54 


250, 500 


375 


66 


69 


69 


56 


57 


65 



^Responders were patients who achieved at least the minimal clinically important difference (MOD; one and four units for TDI and ASGRQ, respectively); TDI, 
Transition Dyspnoea Index; ASGRQ, change from baseline in St George's Respiratory Questionnaire; AFEV l7 change from baseline in trough forced expiratory 
volume in 1 second. 

+ For the outcomes, each row contains approximately 20% of the column's indicated n, as in the left half of Table 2 



using individual patient data from studies of similar 
design. This provides a relatively homogeneous popula- 
tion for analysis, compared with study-level meta- 
analyses. 

We demonstrated that a 100 ml increase in trough 
FEVi (a magnitude of change with perceptible effects 
[25]) was associated with a 0.46-unit increase in TDI 
and a 1.3- to 1.9-unit improvement in SGRQ after a 24/ 
26-week treatment period and, over 12-52 weeks of 
treatment, a 10% decrease in daily rescue medication 
use and a 12% decrease in the annual exacerbation rate. 
In general, we found that treatment, baseline severity, 
concomitant ICS use and world region, did not affect 
the slope of the relationship between outcome and 
change in FEV^ except for ASGRQ where more severe 
COPD, as characterised by a lower FEVi and a higher 
SGRQ at baseline, was associated with a steeper slope, 
compared with less severe COPD. This is consistent 
with results from the 3-year TORCH (TOwards a Revo- 
lution in COPD Health) study, in which trends to 
greater improvement in SGRQ with worsening GOLD 
severity were noted with active treatments [26]. 

Although severe exacerbations showed a trend toward 
greater reductions with increasing AFEV^ the relation- 
ship was not statistically significant. While the observed 
12% reduction in overall exacerbation rate for an 
improvement of 100 ml in YEVi was comparable with 
previously published data [11], the studies included in 
our analysis were not powered to show an effect on 
exacerbations, and did not specifically recruit patients at 
risk of exacerbations. 

We found inconsistent effects of different treatments 
across individual outcomes, perhaps due to patient 
numbers in sub-categories being too low to demonstrate 
consistent differences for individual treatments across 
all outcomes. However, our analysis did demonstrate 
that the relationship between AYEVi and outcome 
appeared to be the same, regardless of treatment arm. 
Similarly, baseline severity, ICS use and world region 
were assessed as main effects, as well as for their 



potential influence on the effect of AFEVi. Although 
numbers of patients in GOLD 4 (as well as GOLD 1) 
were too small to make any inferences, patients predo- 
minantly in GOLD 3 at baseline, and those using ICS, 
consistently exhibited significantly worse outcomes. 
Indeed, the variability in baseline severity and ICS use 
are likely to have been major contributors to the large 
variability in observed outcomes. 

The relationships between outcomes and AFEV! may 
differ between negative and positive AFEVl, and for this 
reason, the models included a possible breakpoint at 
zero in the relationship slope. The inclusion of this 
breakpoint was found to be significant for TDI and 
ASGRQ, suggesting that baseline severity and other 
included covariates could not explain the observed 
behaviour fully. These results may have been influenced 
by differences in withdrawal rates between categories 
[27], since the highest withdrawal rate was in those with 
a negative change in FEV^ although differences between 
groups were minimal. The inclusion of a breakpoint was 
not significant for rescue medication and exacerbations, 
even though Figure 1 may have anticipated its impor- 
tance, especially for exacerbations. The large variability 
and count nature of the data for rescue medication and 
exacerbations may have caused 'Type-2' statistical errors, 
i.e., failure to find the true breakpoints to be significant. 

We found that zero change in FEVi was associated 
with significant positive improvements in TDI and 
SGRQ. Additionally, while a greater proportion of 
patients achieved the MCID for TDI and SGRQ as 
AFEV! increased, our results indicated that as many as 
50% patients responded, irrespective of AFEVi, possibly 
an effect of clinical trial participation seen consistently 
in the placebo limb of clinical trials [28-30]. 

We constructed the models in our analysis using 
AFEVi as a predictor, and the other outcome measures 
as the response variables, based on the results of a care- 
fully-controlled series of clinical trials. However, AFEVi 
was as much a response as was the outcome, so AFEVi 
was not an 'independent' variable controlled as part of 
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Figure 2 Outcomes versus AFEV-,: curves fitted from model-based analysis. SGRQ and TDI data is for Week 24/26. Exacerbations are 
reported counts normalised to 1 year. Rescue medication use is reported numbers of puffs normalised to 1 day. For plotting predicted curves, 
'moderate' refers to baseline FEVt 1.595 I (third quartile of observed values) and GOLD 2 (moderate or less [19]); 'severe' refers to baseline FEVt 
0.95 I (first quartile of observed values) and GOLD 3 (severe or greater [19]). For TDI, 'moderate' and 'severe' refer to BDI at quartiles 2 and 1, 
respectively. For ASGRQ, 'moderate' and 'severe' refer to baseline SGRQ at quartiles 31.3 and 58.13, respectively; AFEV 1; change from baseline in 
trough forced expiratory volume in 1 second; ASGRQ, change from baseline in St George's Respiratory Questionnaire; TDI, Transition Dyspnoea 
Index; GOLD, Global initiative for chronic Obstructive Lung Disease; BDI, Baseline Dyspnoea Index; ICS, inhaled corticosteroid. 
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the experimental design. There may have been further 
confounders that simultaneously affected how both 
AFEVx and the outcome responded to treatment. The 
fitted models therefore describe the observed relation- 
ships under the conditions of a clinical trial, but do not 
provide a definitive answer as to whether there is a cau- 
sal relationship between AFEVi and the outcomes. 

The studies included in our analysis were powered on 
the spirometric endpoint ¥EV 1} which is required by 
regulatory agencies for the approval of bronchodilators, 
and is included in the majority of treatment guidelines. 
For this reason we made FEVi the focus of our analysis. 
Other physiological parameters such as inspiratory capa- 
city may have stronger correlations with dyspnoea [31]. 
However data for these parameters were not available 
from our dataset and further research is needed to 
investigate such correlations in large numbers of 
patients. 

Conclusions 

It is commonly stated that spirometry does not fully 
capture the impact of COPD on a patient's health [32], 
Our analysis of a large cohort of patients has demon- 
strated that in individual subjects, change in FEVi is a 
significant, albeit relatively weak predictor of improve- 
ment in patient-reported outcomes. However, the cur- 
rent analysis also shows that, at a population level, 
improvements in FEVi with long-acting bronchodilator 
therapy are strongly correlated with improvements in 
dyspnoea, health status and exacerbations. This suggests 
that interventions which significantly improve ¥EV 1 are 
also likely to be associated with improved clinical and 
patient-reported outcomes. 
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